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Abstract 



We count the number of occurrences of certain patterns in given words. We choose these words to be 
the set of all finite approximations of a sequence generated by a morphism with certain restrictions. 
The patterns in our considerations are either classical patterns 1-2, 2-1, 1-1- or arbitrary gen- 

eralized patterns without internal dashes, in which repetitions of letters are allowed. In particular, 
we find the number of occurrences of the patterns 1-2, 2-1, 12, 21, 123 and 1-1- • • --1 in the words 
obtained by iterations of the morphism 1 — > 123, 2 — > 13, 3 — > 2, which is a classical example of a 
morphism generating a nonrepetitive sequence. 



1. Introduction and Background 



We write permutations as words 7r = a\a2 ■ ■ ■ a„, whose letters are distinct and usually consist of the 
integers 1, 2, . . . , n. 

An occurrence of a pattern p in a permutation 7r is "classically" defined as a subsequence in 7r (of the 
same length as the length of p) whose letters are in the same relative order as those in p. Formally 
speaking, for r < n, we say that a permutation a in the symmetric group S n has an occurrence of the 
pattern p G S r if there exist 1 < i± < 12 < • • ■ < i r < n such that p = o~{i\)cr(i2) ■ ■ ■ o~(i r ) in reduced 
form. The reduced form of a permutation a on a set {ji, J2, ■ • ■ , jr}, where j± < j% < ■• ■ < j r , is 
a permutation o~\ obtained by renaming the letters of the permutation a so that ji is renamed i for 
all i € {1, . . . , r}. For example, the reduced form of the permutation 3651 is 2431. The first case of 
classical patterns studied was that of permutations avoiding a pattern of length 3 in 1S3 . Knuth [ Knuth| 



found that, for any r 6 5 3 , the number |5„(r)| of n-permutations avoiding r is C„, the nth Catalan 



number. Later, Simion and Schmidt [SimSch] determined the number |<S„(P)| of permutations in S n 
simultaneously avoiding any given set of patterns P Q S3. 



In |BabStein] Babson and Steingn'msson introduced generalised permutation patterns that allow the 
requirement that two adjacent letters in a pattern must be adjacent in the permutation. In order to 
avoid confusion we write a "classical" pattern, say 231, as 2-3-1, and if we write, say 2-31, then we mean 
that if this pattern occurs in the permutation, then the letters in the permutation that correspond 
to 3 and 1 are adjacent. For example, the permutation ir = 516423 has only one occurrence of the 
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pattern 2-31, namely the subword 564, whereas the pattern 2-3-1 occurs, in addition, in the subwords 



562 and 563. A motivation for introducing these patterns in [BabStein was the study of Mahonian 
statistics. A number of interesting results on generalised patterns were obtained in |Claes | . Relations 
to several well studied combinatorial structures, such as set partitions, Dyck paths, Motzkin paths 
and involutions, were shown there. 



Burstein [Burstcin] considered words instead of permutations. In particular, he found the number 
|[£;] ra (P)| of words of length n in k- letter alphabet that avoid each pattern from a set P C £3 simul- 
taneously. Burstein and Mansour |BurMansT (resp. [BurMans2, BurMans3|) considered forbidden 
patterns (resp. generalized patterns) with repeated letters. 

The most attention, in the papers on classical or generalized patterns, is paid to counting exact 
formulas and/or generating functions for the number of words or permutations avoiding, or having 
k occurrences of, certain pattern. In this paper we suggest another problem, namely counting the 
number of occurrences of a particular pattern r in given words. We choose these words to be a 
set of all finite approximations (to be defined below) of a sequence generated by a morphism with 
certain restrictions. A motivation for such a choice is big interest in studying classes of sequences and 



words that are defined by iterative schemes [Lothairc, Salomaa]. The pattern r in our considerations 
is either a classical pattern from the set {1-2, 2-1, 1-1- • • • -1}, or an arbitrary generalized pattern 
without internal dashes, in which repetitions of letters are allowed. In particular, we find that there 
are (3 • 4™ _1 + 2") occurrences of the pattern 1-2 in the n-th finite approximation of the sequence w 
defined below, which is a classical example of a nonrepetitive sequence. 

Let E be an alphabet and E* be the set of all words of E. A map ip : E* — + E* is called a morphism, 
if we have tp(uv) — (p(u)ip(v) for any u, v G E*. It is easy to see that a morphism p can be defined by 
defining p(i) for each i G £. The set of all rules i — » ip(i) is called a substitution system. We create 
words by starting with a letter from the alphabet £ and iterating the substitution system. Such a 



substitution system is called a DOL (Deterministic, with no context Lindenmayer) system [LindRoz 



DOL systems are classical objects of formal language theory. They are interesting from mathematical 
point of view [Frid], but also have applications in theoretical biology Lind]. Let |A| denote the length 
of a word X, that is the number of letters in X. 

Suppose a word (p(a) begins with a for some a € E, and that the length of p k (a) increases without 
bound. The symbolic sequence lim >p k (a) is said to be generated by the morphism (p. In partic- 

k — >oo 

ular, lim <p k (a) is a fixed point of ip. However, in this paper we are only interesting in the finite 

k — >oo 

approximations of lim ip k (a), that is in the words <p (a) for k = 1, 2, . . .. 

k — >oo 

An example of a sequence generated by a morphism can be the following sequence w. We create 
words by starting with the letter 1 and iterating the substitution system <f> w : 1 — > 123, 2 — > 13, 3^2. 
Thus, the initial letters of w are 123132123213.... This sequence was constructed in connection with 
the problem of constructing a nonrepetitive sequence on a 3-letter alphabet, that is, a sequence that 
does not contain any subwords of the type XX — X 2 , where X is any non-empty word over a 3-letter 
alphabet. The sequence w has that property. The question of the existence of such a sequence, as 
well as the questions of the existence of sequences avoiding other kinds of repetitions, were studied 
in algebra [Adian, Justin, Kol|, discrete analys is JCarpi , Dckk, Evdok, Ker, Pleas | and in dynamical 
systems Morselled] ] . In Examples 2.2, 2.6 and 3.3 we give the number of occurrences of the patterns 
1-2, 2-1, 1-1- 12, 123 and 21 in the finite approximations of w. 
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To proceed further, we need the following definitions. Let N^(n) denote the number of occurrences of 
the pattern r in a word generated by some morphism 4> after n iterations. We say that an occurrence 
of r is external for a pair of words (X, Y), if this occurrence starts in X and ends in Y. Also, an 
occurrence of t for a word X is internal, if this occurrence starts and ends in this X. 

2. PATTERNS 1-2, 2-1 AND 1-1-.. .-1 

Theorem 2.1. Let A = {1, 2, . . . , k} be an alphabet, where k > 2 and a pattern r <E {1-2, 2-1}. Let 
X\ begins with the letter 1 and consists of £ copies of each letter i G A {£~> 1). Let a morphism <f> be 
such that 

l^Xx, 2^X 2 , 3 X 3 , . . . , k -f X k , 
where we allow Xi to be the empty word e for i = 2, 3, . . . , k (that is, <f> may be an erasing morphism), 

k 

\Xi \ = k ■ d, and each letter from A appears in the word X2X3 . . . Xk exactly d times. Besides, let 

1=2 

e-i.j (resp. ej) be the number of external occurrences of t for (Xi,Xj) (resp. (Xi,Xi)), where i =/= j. 
Let Si be the number of internal occurrences of t in Xi. In particular, Sj = ej = e^j — ejj = 0, 
whenever X, = e; also, e; = \Xi\ ■ (|Xj| — l)/2, whenever there are no repetitive letters in Xi. Then 
iV£(l) = Si and for n>2, N^(n) is given by 

(rf+r 2 E^+( (d+ 2 r2 )E e . + (-i+^r 4 £ 

i=l ^ ' i=l l<i<j<k 

Proof. We assume that r = 1-2. All the considerations for this r remain the same for the case t = 2-1. 
If n = 1 then the statement is trivial. 

Suppose n > 2. Using the fact that X1X2X3 . . . Xk, has exactly d + £ occurrences of each letter i, 
i = 1,2, . . . , k, one can prove by induction on n, that the word <fi n (l) is a permutation of (d + £) n ~ 2 
copies of each word Xi, where i = 1, 2, . . . , fc. This implies, in particular, that \4> n (l)\ = k ■ (d + £) n ~~ 1 . 

An occurrence of r in </>™(l) can be either internal, that is when r occurs inside a word Xi, or external, 
which means that r begins in a word Xi and ends in another word Xj. In the first of these cases, since 
there are (d + £) n ~ 2 copies of each Xi, we have (d + £) n ~ 2 J2i=i s » possibilities. In the second case, 
either i — j, which gives (j d+ ^ ) J2i=i e » possibilities, or i ^ j, in which case there are (d + £) n ~ 2 
possibilities to choose Xi (resp. Xj) among (d + £) n ~ 2 copies of Xi (resp. Xj), and using the fact 
that &ij = ej i (the order in which the words Xi and Xj occur in (j) n (l) is unimportant), we have 
(d + £) 2n ~ A £ 

i<i<i<fc e i ,j possibilities. Summing all the possibilities, we finish the proof. □ 

Let s (res p. e ) denote the vector (si, S2, • • • , Sfe) (resp. (ei, e 2 , ■ ■ ■ , ek)), whe re Si and ej are defined in 
Theorem |2.l| . All of the following examples are corollaries to Theorem [2.l[ 

Example 2.2. If we consider the morphism <f> w defined in Section [I] and the pattern r = 1-2 then 
d = I = 1, s = (3, 1, 0), e = (3, 1, 0) and e^a = £2,1 = 2, ei^ = e^^ — X, e 2 ,3 — e^, 2 — 1. Hence, the 
number of occurrences of t is given by N^~ 2 (\) = 3 and, for n > 2, N^~ 2 (n) = (3 ■ 4 ra_1 + 2")/2. If 
t = 2-1 then s = (0,0,0), e = (3,1,0) and = &2,i = 2, ei^ = 63,1 = 1, e 2 ,z = £3,2 = 1. Hence, 
iV?-i(i) = and, for n>2, N 2 -\n) = (3 • 4™" 1 - 2")/2. 
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Example 2.3. If we consider the morphism <f>: 1 — > 1324, 2 — > e, 3 — > 14, and 4 — > 23 then for the 
pattern r — 1-2, we have d = £ = 1, s = (5, 0, 1, 1), e = (6, 0, 1, 1), and e^j , /or i ^ j, are elements of 
the matrix 



- 





3 


3 













3 







2 


V 3 





2 





>n— 2 



ffence, A^" 2 (l) = 5 and, /or n > 2, A^~ 2 (n) = 3 • 4™" 1 + 11-2 

Example 2.4. 1/ we consider the morphism (j>: 1 — > 13542, 2 — » 423, 3 — > e, 4 — > 5115, and 5 —* 234 
</ien /or the pattern t — 1-2, we have £ = 1, d = 2, s = (6, 1, 0, 2, 3), e = (10, 3, 0, 4, 3), and e^j, /or 
i ^ j , are elements of the matrix 
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1-2-3 



Using the proof of Theorem ^lj, we have the following. 

Theorem 2.1 

the pattern r 



Theorem 2.5. Let a morphism <\> satisfy all the conditions in the statement of Theorem 2.1 and 
1-1- ■ • • -1. Then, for n > 2, the number of occurrences of r in <fi n (l) is given by 



r times 

k ■ (^ d+e ) ), whereas for n = 1, by k ■ . 



Proof. From the proof of Theorem 2.1, we have that if n > 2 (resp. n = 1) then 4> n (l) has exactly 

(d + (resp. i) copies of each letter from A. We can choose r of them in (^- d+e J ) (resp. (^)) 

ways to form the pattern r. The rest is clear. □ 



The following example is a corollary to Theorem 2.5. 

Example 2.6. If we consider the morphism 4> w defined in Section |I| and the pattern t = 1-1-1-1 then 
d = £ = 1, r = 4, hence the number of occurrences of t in <f> n (l) is 0, whenever n = 1 or n — 2, and 
3 • ( 2 4 ) othe 



terwise. 



3. Patterns without internal dashes 



In what follows we need to extend the notion of an external occurrence of a pattern. Suppose W — 
AXBYC, where A, X, B, Y and C are some subwords. We say that an occurrence of r in W is 
external for a pair of words (X,Y), if this occurrence starts in X, ends in Y and is allowed to have 
some of its letters in B. For instance, if W = 12324245, where A = 1, X = 23, B = 2 and Y = 424 
then an occurrence of the generalized pattern 213, namely the subword 324 is an external occurrence 
for (X,Y). 
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Theorem 3.1. Let A — {1,2,..., k} be an alphabet and a generalized pattern r has no internal 
dashes. Let X\ begins with the letter 1 and consists of £ copies of each letter i £ A (£ > 1). Let a 
morphism <p be such that 

1 -> Xl, 2 — > X 2 , 3 — > X 3 , . . . , k — > 
where we allow Xi to be the empty word e for i = 2, 3, . . . , A; (i/iai is, may 6e an erasing morphism), 

k 

|Xj| — k ■ d, and each letter from A appears in the word X2X3 . . . Xk exactly d times. Besides, we 

i=2 

assume that there are no external occurrences of t in <p n (l) for the pair (Xi, Xj) for each i and j . Let 
Si be the number of internal occurrences of t in X t . In particular, s, = 0, whenever X; t = e. Then 
JVJ(l) = 5i and for n>2, iVT(n) = (d+ £) n ~ 2 s i- 



Proof. The theorem is straightforward to prove by observing that for n > 2, <j> n (l) has (d + £) 



n-2 



occurrences of each word Xj (see the proof of Theorem 2.1). □ 



Remark 3.2. In order to use Theorem \3. i| , we need to control the absence of external occurrences 
of a pattern t for given t (without internal dashes) and a morphism (j). To do this, we need, for any 
pair (Xi,Xj), to consider all the words XiWXj, where \W\ < \t\ — 1, and W is a permutation of a 
number of words from the set {X±, X2, ■ ■ ■ , Xk}. 



The following examples are corollaries to Theorem 3.1. 



Example 3.3. // we consider the morphism cj) w defined in Section [I] and the pattern r = 12 then all 



the conditions of Theorems 3.1 hold. In this case d = £ = 1 and s = (2, 1,0). Hence, the number of 
occurrences of the patterns 12, that is the number of rises, is given by jV^ 2 (1) = 2 and, for n > 2, 
N^(n) = 3 • 2"~ 2 . Ifr = 123 then we can apply the theorem to get that for n>2, N^ 2 J(n) = 2 n ~ 2 . 

If we want to count the number of occurrences of the pattern t = 21, that is the number of descents, 



then we cannot apply Theorem 2.1, since for instance, the pair (X\,X2) — (123, 13) has an external 
occurrence of r. However, it is obvious that the number of descents in <j) n (l) is equal to |</> n (l)| — 
Nll(l) - 1 = 3 • 2"- 2 - 1. 

Example 3.4. // we consider the morphism <fi: 1 —> 1243, 2 — *■ 3, 3 — > e, and 4 — > 124 then for 



the pattern r = 123, all the conditions of Theorems 3.1 hold. In this case d = £ = 1, s = (1,0,0,1). 
Hence, for n > 1, N^ 23 (n) = 2™ . For r = 321 we cannot apply Theorem 3.1, since the pair (X4, X\) 
has an external occurrence of t (look at X 4^X2X1 = 12431243J. Consideration of the words X 4X2 and 
X4X1 implies that the theorem cannot be apply for the patterns 132 and 231 respectively. However, 
we can apply the theorem to the pattern 213 to prove that it does not occur in </> n (l) for any n. 
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